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The Mobius domain wall action [[l|] is a generalization of Shamir's action, which gives exactly 
the same overlap fermion lattice action as the separation (L s ) between the domain walls is taken 
to infinity. The performance advantages of the algorithm are presented for small ensembles of 
quenched, full QCD domain wall and Gap domain wall lattices [Qj. In particular, it is shown that 
at the larger lattice spacings relevant to current dynamical simulations Mobius fermions work 
well together with GapDWF, reducing L s by more than a factor of two. It is noted that there is a 
precise map between the domain wall and effective overlap action at finite quark mass including 
finite L s chiral violations so that the Ward-Takahashi identities for the axial and vector currents 
are exactly equivalent in the two formulations. 
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1. Introduction 

Domain wall fermions provide an efficient and rigorous implementation of chiral symmetry in 
lattice field theory at finite lattice spacing. Following the original ideas of Kaplan and Shamir one 
introduces two 4-d domain walls (or 3 branes) separated by L s lattice sites in a 5th dimension. The 
5-d domain wall action, 

Sow = £[y x A D Dw(mm XiS + <S> Xi s(DDw(l)<l>hA , (1-1) 

x,s 

contains 5-d Wilson fermion (*F A , S ) and Pauli-Villars pseudo-fermion (& X)S ) fields which enjoy a 
"kinematical" super symmetry broken only by the boundary conditions on the 3-branes (see Fig. [[]). 
The result is an effective 4-d action with low mass states below the cut-off representing a left(right) 
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Figure 1: Domain wall convention with left/right chiral mode at s = 1 and s = L s respectively approximated 
by 3-branes separated by a distance a$L s . The Pauli Villars (m = 1) and the zero mass Dirac (m = 0) operators 
obey anti-periodic and Dirichlet boundary conditions respectively. 



chiral fermion on each wall at s = l(L s ) respectively in the limit of infinite separation between 
the walls. After a long and interesting history of competing methods, it is now realized that the 
effective 4-d theory at L s = oo is equivalent to one based on Neuberger's overlap operator, with 



'l+m 1 —m 



xy 



¥ (1-2) 



and an appropriate Dirac "Hamiltonian" in the sign function s[H]. The two actions lead to equiva- 
lent matrix elements D~ v l iXy (m) = (¥x¥ y )ov = {qxq y )DW, where the fields q,q, shown in Fig. [TJ, are 
mathematically defined in Sec. [|. 

For zero mass quarks, D ov (0) obeys the Ginsparg -Wilson relation, 

7 5 D m ,(0) +D ov (0)y 5 = 2D OV (0)7 5 D OV (0) , (1.3) 

or equivalently the anti-commutator, {y5,£)~ v (0)} = 2/5, which guarantees exact chiral symmetry 
at finite lattice spacing and zero quark mass, m.f = m/(l — m). Since all implementations give 
solutions to the GW relation, the debate on the virtues of overlap vs. domain wall fermions is 
essentially algorithmic in nature. For domain wall algorithms, practical considerations demand that 



2 



Mobius Algorithm for Domain Wall and GapDW Fe rmions 



Richard C. Brower 



the L s — ► oo limit be approximated by modest values: L s = O(10). This is not a trivial requirement. 
Finite L s causes a residual breaking to the GW relation as measured by the difference operator, 

2y 5 A Ls [H] = y 5 D ov (0)+D ov (0)y 5 - 2D ov (0)y 5 D ov (0) . (1.4) 

The conventional criterion for estimating this violation of chiral symmetry is to compare the 
magnitude of "residual mass", 

Tr[Dt- l D m }} 

relative to the explicit quark mass mj = m/(l—m). As emphasized by Sharpe [|[], at current lattices 
spacings a residual mass 0(10 3 ) is adequate but some quantities require an order of magnitude 
smaller residual mass. Moreover recent applications to finite temperature and JV = 1 SUSY QCD 
have required much larger values of L s = 0(100). Better methods are needed to reduce m res at 
reasonable values of L s . 

Within this framework there still remains a large space of options for the lattice partition 
function, 

Z[U] = J @y@yeP Tr ^ Up } +Sim P rmed [ U } +¥ D ov(m)[U]\if (L6) 

by improving the gauge action and/or the approximation to the overlap operator. Here we re- 
examine the Mobius formulation of the domain wall algorithm [111], verifying that very substantial 
improvements can be made in the convergence rate to the exact chiral fermion at L s = oo. In addition 
there is a strong feedback between improved gauge and fermionic algorithms. In particular we point 
out when the so called "Gap domain wall" modification of the gauge action combines nicely with 
the Mobius fermion action to give multiplicative improvements — each reducing the residual mass 
by separate orders of magnitude. 

2. Mobius Recipe 

The simplest kernel for the overlap algorithm is the Wilson Hamiltonian operator, H = y 5 D w,lson (Ms), 
where 



D™ lson {M 5 ) = (4+M 5 )4,v - \ (1 - y»)u x ^8 x+M + (l + y^ul + , <l v 



(2.1) 



with a negative mass parameter M$ € [—1,-2]. However for the domain wall algorithm, the sim- 
plest implementation is the Shamir form, H = y=,D sham,r (M$) , 

n Skamir (M , _ gsP^Ms) 

° {M5) ~ 2 + a 5 D^n [M5) ■ ( 2 - 2 ) 

The Mobius form is a real 3 parameter Mobius transform of the Wilson kernel interpolating be- 
tween both of these, 

D Moebius (M 5 ) = ( b 5+C 5 )D W ' ls "' 1 (M 5 ) aD Shamir, M j (23) 
v J 2 + (b 5 -c 5 )D wdson (M 5 ) y ' 
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Relative to Shamir's kernel this introduces 1 new "scaling" parameter a = (bs + C5) / a$ at fixed 
a$ = Z?5 — C5. Due to the scale invariance of the sign function, e[aA] = e[A], this does not change 
the L s = 00 chiral lattice action. Consequently at fixed 05, M$ the Mobius rescaling should be 
regarded as an improved algorithm for the same chiral action, optimized by choosing a(L s ) 
to minimize m res at finite L s . 

Why is this freedom to rescale H desirable? The difficulty with the domain wall approach is 
that the resultant polar approximation to the sign function, 



(1+H) L *-(1-H) L * 
(1+H) L ' + (1-H) L ' 



tanh[-(V2)lo g r] , 



(2.4) 



is exponentially convergent only for eigenvalues A of H inside the interval: log(|A|) € [l/L s ,L s ]. 
So the advantage of rescaling at finite L s is to use this interval more efficiently by shifting the 
spectrum log (A) — ► log (a) +log(A). Other approaches to improving the polar approximation that 
have been suggested include explicit projection of a finite set of eigenvalues at small A and/or 
suppressing the number of small eigenvalues by changing the gauge action. Indeed these may be 
combined together with Mobius fermions to gain additional advantage as illustrated here by the 
Mobius rescaling of the GapDW lattices. Next we explain how the domain wall implementation of 
this rescaling naturally involves the two parameters, bs,c$. 

The Mobius generalization of Shamir merely requires that the Wilson kernel be included in 
the 5th dimensional hopping term, 



D uw (m) s ,, = D™P + SsJ+D™ P- V- 

- mD W P + 5 Stl 8^ s - mD^P- S.j.S,.: 



(2.5) 



with P± = ±(1 ± y 5 ) and D$ = b 5 (s)D Wilson (M 5 ) + 1, D w = c 5 (s)D WUson {M 5 ) - 1 with s,s' 
1 , 2 , • ■ • L s or in L s x L s matrix notation. For the rescaling example discussed we take bs (s) + C5 (s) = 
ocas , Z75 (s) — C5 (s) = a$ , so the s-dependence for D^) can be dropped, however we have included 
it so that the Mobius class includes other approaches such as the Zolotarev approximation or the 
variable fields suggested by Bar, Narayanan, Neuberger and Witzel In matrix notation: 

-mD (1) P+ 














dL 2) p + 




D (2) p _ 


D DW (m) = 





dL 3) p + 


D® 




-mD ( t s) P- 









D 








(2.6) 



3. Performance Measures 



Fortunately the new off diagonal Wilson operators in the Mobius domain wall action can be 
implemented with essentially no additional algorithmic complexity. The first step, suggested in 
Ref [|I|], is to replace 5-d red/black preconditioning by a 4-d checker board with no alternation of 
color along the 5th axis. The new form of the Schur complement solves analytically all interaction 
in the fifth dimension. The performance of 4-d versus 5-d red/black preconditioning, if anything, 
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favors this construction. Then with a simple gather operation for the three spinors in the jx direc- 
tion, the number of Wilson Dirac application per CG iteration is identical. A full comparison of 
performance on a range of lattices is impossible in this short talk. So we consider three examples 
with more to be presented in a future publication. 
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Number of Dirac applications 
Figure 2: The Mobius algorithm on pure gauge lattices compared with Shamir (a = 1). 

Quenched Lattices: As demonstrated in the original proposal the Mobius formulation has 
the potential of an order of magnitude reduction of the explicit chiral symmetry breaking at fixed 
computational cost. A comparison of the residual masses is given in Fig. ||]on j8 = 6.0 quenched 
lattices with aT^ ~ 2.lGeV. For these lattices the optimal rescaling satisfies the empirical form, 
a(L s ) ~ 1 +L s /8. Note that for small residual masses the advantage of scaling is huge. 
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Figure 3: The Mobius algorithm on full QCD domain wall lattices compared to Shamir (a = 1). 

Domain Wall Lattices: Test were also carried out on full domain wall lattices which generally 
exhibit worse convergence to small m res . However as an example in Fig. ||the same comparison is 
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made on a set of DWF Nf = 2 + 1 Iwasaki lattices [Q] at j3 = 2.13 with m s = 0M,mi/m s = 1/4, 
aT x ~ l.lGeV . Even without carefully tuning the rescaling parameter the advantage appears to be 
nearly as dramatic as for the quenched lattices. More thorough studies of this are underway. 
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Figure 4: The Mobius algorithm (a = 2) on pure gauge and Gap lattices vs Shamir (a = 1). 

Gapped Lattices: A more radical suggestion to suppress small eigenvalues in H was sug- 
gested by Vranas [Q] which by adding a pair of Wilson fermions with mass M5 < induces an 
improvement term, 



SaapPnix)] = Tr[\og{rt mhon {M 5 )D Wllson {M 5 ))] 



(3.1) 



for the effective gauge action. To test the locality of this action one should recognize that the bilocal 
dependence on the gauge field is measured by "axial correlator" 



8u,(x)$u v (y) s Ga P [U] = (j 5 G ap (x)fJ ap (y)) ~exp[-|x-y|/§» 



(3.2) 



whose long distance tail is the would be "pion" propagator for the Gap fermions. Ref. [||] demon- 
strated that this correlation is 0(1) in lattice units as you approach the continuum limit, roughly the 
same degree of locality in the overlap action itself. Fig. ^ compares (a = 2) vs Shamir (a = 1) at 
M5 = — 1.8, m = 0.02 for pure gauge and Gapped lattices both with cT x = lAGev. 



4. Discussion 

Figs. ||- ^| point to a general rule that a Mobius scaling transformation of Shamir by a factor of 
a > 1 allows one to reduce L s and therefore the cost by a factor of 1 /a at fixed m res . This is a natural 
consequence of the scaling relation, A^faA] = A^JA], for small eigenvalues, |A| < 0(\/L s ). 
Tuning a > 2 for large L s gives additional savings. There appears to be an additional computational 
cost for a > 2 on the order of 10% due to increased condition number. Alternatively we note at 
fixed L s = O(10), the Mobius algorithm can reduce m res by an order of magnitude or more at fixed 
cost. The combined improvement in the Mobius algorithm on Gapped lattices is multiplicative so 
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here one might contemplate exploratory studies in beyond the standard model strong dynamics with 
L s in the range of 4 to 8. For high precision studies m res can be reduced below 10~ 5 for L s = 32. 
Both of these are attractive options. 

In a subsequent publication more details on the efficiency and formal properties of the Mobius 
algorithm will be provided. We simply note here that a straight forward general formalism exists 
that allows all correlators as well as the Ward-Takahashi identities to be expressed independent 
of the detailed form of the Mobius domain wall action. For example the application of LDU 
decomposition leads to the basic identity, 



D DW {\)0>} S , S 



D DW i 



m 



m)Ap^{m) 





1 



_(\-m)A R L p m } 

where & s \ s = P-S s i s +P + 5y i>s+ i rotates the Right fermion to the Left wall in Fig. It follows 
from this that 

D ov,xy( m ) = (YxWy)ov = (flrtfyW , (4-2) 

defining the appropriate chiral domain wall fields on the boundary: q x = [^^jx.i and q x = 
[^Dowi^^x,! ■ All the matrix elements are directly related to the transfer matrix, T = (1 — 
H)/(\+H), through the partial left and right products 1 ,A^ = T~ s / (\ + T~ Ls ) and Af +1 = T s ~ Ls /{\ + 
T~ Ls ). Together they give the GW chiral breaking operator, Al s [H] = AjAj, x , defined above in 
Eq. |Q| . Similar arguments leads to a general map between all overlap and domain wall correla- 
tors, with only implicit reference to the Mobius operator Dpy/im). From this map, the vector and 
axial Ward-Takahashi identities must be identical for both overlap and domain wall actions at finite 
L s , lattice spacing and finite volume. 

In summary little change in the formalism or software is required to use the Mobius algorithm, 
while providing a substantial improvement in performance. A very efficient code for the BlueGene 
has been written by Andrew Polchinski under the SciDAC software project and is readily available 
at the software links for USQCD: http://usqcd.org. 

This work was supported in part by US DOE grant DE-FG02-91ER40676, NSF grant DGE- 
0221680, NSF CCF-0728915, and the Jeffress Memorial Trust grant J-813. 
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For simplicity we have dropped the s-independence of transfer matrix. The general case requires s-ordered product. 
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